Model Selection

XLSR-53 Fine-tuning

# XLSR-53 Fine-tuning

Exp W2v2t Ja Xlsr 53 S109

Japanese automatic speech recognition model fine-tuned based on facebook/wav2vec2-large-xlsr-53, trained using Common Voice 7.0 Japanese dataset

Speech Recognition

Transformers Japanese

Ai Light Dance Singing Ft Wav2vec2 Large Xlsr 53 5gram V1

This model is an automatic speech recognition model based on wav2vec2-large-xlsr-53, fine-tuned on the GARY109/AI_LIGHT_DANCE - ONSET-SINGING dataset, primarily used for singing voice recognition.

Speech Recognition

Wav2vec2 Common Voice Tr Demo Dist

This model is an automatic speech recognition (ASR) model fine-tuned on the COMMON_VOICE - TR Turkish dataset based on facebook/wav2vec2-large-xlsr-53, achieving a word error rate of 0.3242 on the evaluation set.

Speech Recognition

Transformers Other

Wav2vec2 Large Xlsr 53 Japanese

Japanese speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, supporting 16kHz sampling rate audio input

Speech Recognition Japanese

Wav2vec2 Luganda

A Luganda automatic speech recognition system fine-tuned from Facebook's wav2vec2-large-xlsr-53 model, achieving 7.53% WER on the Common Voice Luganda dataset.

Speech Recognition

Transformers Other

Wav2vec2 Large Xlsr Arabic

A Wav2Vec2-Large-XLSR-53 model fine-tuned for Arabic speech recognition, trained on the Common Voice and Arabic Speech Corpus datasets

Speech Recognition Arabic

Wav2vec2 Common Voice Tr Demo

This is an automatic speech recognition (ASR) model fine-tuned on the COMMON_VOICE - TR Turkish dataset based on the facebook/wav2vec2-large-xlsr-53 model.

Speech Recognition

Transformers Other

Wav2vec2 Large Xlsr Gu

Gujarati automatic speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, achieving 23.55% WER on OpenSLR dataset

Speech Recognition Other

Wav2vec2 Large Xlsr 53 Chinese Zh Cn

A Chinese speech recognition model fine-tuned based on facebook/wav2vec2-large-xlsr-53, supporting 16kHz sampling rate audio input.

Speech Recognition Chinese

Wav2vec2 Large Xlsr Bengali

A Bengali automatic speech recognition model fine-tuned based on facebook/wav2vec2-large-xlsr-53, trained with 40,000 speech samples from the OpenSLR dataset

Speech Recognition Other

Wav2vec2 Hausa2 Demo Colab

This model is a Hausa speech recognition model fine-tuned on the Common Voice dataset based on facebook/wav2vec2-large-xlsr-53

Speech Recognition

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase